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Abstract 

A model of self-avoiding walk with suitable constraints on self-attraction is developed to describe 
the conformational behavior of a short RNA or a single stranded DNA molecule that forms hairpin 
structure and calculate the properties associated with coil-hairpin transition by enumerating all 
possible conformations of a chain of N monomers in two and three dimensions. The first and last 
five monomers of the chain have been allowed to pair and form the stem of the hairpin structure 
while the remaining monomers can form a loop. The coil-hairpin transition is found to be first 
order with large entropy change. While the rate of unzipping of the hairpin stem is found to be 
independent of the length of the loop and the dimensionality of the space, the rate of closing varies 
greatly with loop length and dimensionality of the space. 
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The hairpin structure is often observed in RNA and single stranded DNA (ssDNA) 
molecules and is known to participate in biological functions such as the regulation ofgene 
expression, DNA recombination, and facilitation of mutagenic events etc jl, 2], [], J [5)]. 
The stability and conformational fluctuations of the hairpin structure have recently been 
investigated by designing simple ssDNA oligonucleotides , llOj that have few com- 



plimentary bases at both ends of the chain and one type of nucleotides in the middle, e.g. 
5'— CCCAA - (X) m - TTGGG— 3' where X is any one of the four nucleotides and m its 
number. A hairpin structure has two structurally and dynamically distinct domains: a base 
paired stem and a single stranded loop connecting two halves of the stem. The stem shows 
the same response to change in solution conditions as a dsDNA oligomer. The loop region, 
however, shows a wide range of folding patterns that depend on the number and type of 
bases in the loop. The stem-loop structure fluctuates thermodynamically between different 
conformations, which in a simplified description, are divided into two main states: the open 
state when all the binding monomers are separated and the fully closed one where all the 
complimentary bases are paired. However, as shown below, this simplification is not always 
correct. 

The closed-to-open transition requires the large energy to unzip the base pairs of the 
stem whereas the closing transition requires the two arms of the loop come close to each 
other in space in such a way that hydrogen bonding interaction between the complimentary 
nucleotides can take place. The understanding of the nature of the open-to-closed transition 
is essential to our understanding of biopolymer dynamics. 

By attaching donor and accepter flurophores to both ends of a ssDNA the open-to-closed 
conformational dynamics of the hairpin have been investigated 0, Q, [9]. The fraction of 
the open state is displayed in terms of the melting curve, which depicts the variation of the 
static fluorescence intensity with temperature. The melting temperature T m of the structure 
is defined as the temperature where the probabilities of the closed and open states are equal. 
In such experiments it is, however, difficult to tell whether the closed state corresponds to a 
hairpin structure with some base pairs intact [Fig. l(a-c)] or whether it is merely a folded 
state where the donor and accepter are quite close without base pairing [Fig. l(d-f)]. 

Attempts have recently been made to understand the conformational fluctuations of 



hairpin-loop structure in telehelic chain using Monte Carlo simulations 
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12|]. In these 



calculations one or two attractive sites with spherically symmetric interactions have been 
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FIG. 1: Schematic representation of possible configurations of bases of the two ends of a chain on 
reaching close to each other. In (a) all the complementary bases are paired whereas in (b) and (c) 
the pairing is only partial. In situations shown in (d) to (f) pairing cannot take place even though 
two ends of the chain are close to each other. 

attached at both ends of a chain of hard spheres. However, as stated above, in ssDNA 
and RNA the hairpin structure is formed due to pairing of complementary bases through 
hydrogen bonds which are highly directional and therefore, spherical nature of attraction 
taken in these studies does not represent the real situations. 

In this communication we propose a lattice model of self-avoiding walk with constraints 
that mimic the condition of pairing of bases and calculate the conformational behaviour of 
the hairpin-loop structure. A self-avoiding walk of N steps (N+l vertices) is considered on 
a square lattice in two- dimensions (2D) and on a cubic lattice in three dimensions (3D). A 
step of the walk represents a monomer of the chain. With each monomer a base is attached 
which has a direction associated with it. The first n bases of the walk represent nucleotide, 
say A (or C) and the last n bases the complementary nucleotide T (or G). The remaining 
N — 2n = m bases represent one type of nucleotides which do not participate in pairing. The 
repulsion between monomers at short distances (i.e excluded volume) is taken into account 
by the condition of self-avoidance. 

The pairing between bases can take place only when the bases at the two ends of the chain 
representing the complementary nucleotides approach on the neighbouring lattice bonds 
with their directions pointing to each other (see Fig. 1(a)). A base can at most pair with a 
complementary base. The base pairing can not take place when the two bases representing 
complementary nucleotides approach on neighbouring lattice bonds but with their directions 
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not pointing to each other as shown in Fig. l(d-f). In Fig 1(b) and (c) a situation in which 
only partial pairing can take place is shown. 

All possible conformations of ssDNA of N bases mapped by the self-avoiding walks [lc 
with the constraints specified above and having steps N < 27 on a square lattice (in 2D) 
and iV < 19 on a cubic lattice (in 3D) have been exactly enumerated 14j, |l5|. The partition 
function of the system is found from the relation, 

n 

Z N (T)=J2C N me-^ T y (1) 

i=0 

where C^{i) is the total number of configurations corresponding to walk of iV steps with 
i number of intact base pairs. In all the results reported below we have taken n = 5 and 
e = -0.08 eV. 

The partition function defined by Eq. (1) has six terms corresponding to six values of i 
from to 5 (z = corresponds to the open state, 1 < i < 4 to the partially bound state 
and i = 5 to the fully bound state of the hairpin-loop structure). We can calculate the 
probability of these states for a chain of length N from the relation 

P.iT) = f§ (2) 

where Z' N {T) = C N (i)e~^ ksT and Z N (T) = £- = o Z* N . 

Let the probability of the closed state which includes both partially and fully closed 
configurations of the chain is defined as 

K(T) = (3) 

where Z' N (T) = £-.1 Z' N . 

The values of Pi{T) of partially bound states {i.e. for 1 < i < 4) are found to be small 
compared to the values of Po{T) and -Ps(T) but not always negligible. Our results show that 
as the length of the part of the chain that forms loop in hairpin-loop structure increases, 
the values of Pi{T) of the partially bound states decreases and is expected to be negligibly 
small when m » 2n. However, when m < 2n, the contribution due to partially bound 
states are significant and cannot be neglected. The values of Pi{T) is found to peak at some 
temperature and the position of the peak shifts to the lower temperatures as i is increased 
from 1 to 4. The peak height decreases on increasing the chain length. In 2D for chains of 
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FIG. 2: Comparison of probability (Pj(T) of partially closed states (1 < i < 4) for a chain of 
iV = 19 monomers in two and three dimensions, (a) i = 1, 3 and (b) i = 2, 4. 

small length, iV < 21, the peak of Ps{T) is found higher than that of P^{T) (see Fig. 2), but 
for N > 23 the peak of Pa(T) becomes higher than that of P$(T). In 3D we, however, find 
that the peak of Pa{T) is higher than that of P$(T) for all chain lengths studied by us. In 
Fig. 2, we plot the values of Pi(T) found in 2D and 3D for a chain having N = 19 monomers 
to show the temperature dependence and magnitude of Pi{T) of partially bound states. We 
note that both the peak height and the width of Pi(T) are larger in 2D than in 3D. This 
shows that the probability of finding partially bound states is less in 3D than in 2D. This is 
because of the competition between thermal fluctuations which forces the chain to be in the 
open state and pairing of bases which take place when complimentary monomers approach 
each other in a particular way. Since the contribution arising due to thermal fluctuations 
in 3D is higher and the energy gained due to formation of pairs is same in 3D and 2D, the 
partially bound states are less stable in 3D than 2D. 

The values of Pq(T), P${T) and P C {T) as a function of T are plotted in Fig. 3 for chains 
of N = 15 and 27 in 2D and in Fig. 4 for N = 15 and 19 in 3D. The difference in the values 
of P C (T) and P^{T) is due to partially bound states which are included in P C {T) but not in 
P 5 (T). The difference, as expected, seems to decrease on increasing the length of the chain 
and in going from 2D to 3D. The melting temperature T rn {i.e. the temperature at which 
transition from the hairpin-loop state to coil state takes place) is found from the intersect 
of Pq(T) and P C {T) which takes place at their values of 0.5. We list in Table I the values of 
T m for different chain lengths in both 2D and 3D cases. The value of T m is found to have 
strong dependence on the chain length (T m decreases as chain length increases) and on the 
dimensionality of the space. 
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FIG. 3: Comparison of probability Pi(T) of opened (i = 0) [solid line], completely closed (i = 5) 
[dotted line], and sum of partially and fully closed states (X)i=i *) [dashed line] as a function of 
temperature T, in two dimensions, (a) N=15, (b) N=27. 

The Helmholtz free energy and other thermodynamic quantities like the internal energy 
([/), the entropy (S) and the specific heat (C) can be found from the standard relations; 

F = -k B T\nZ N {T) = -k B T(\nZ° N -]nP (T)) (4) 

5 

U = ^ iePi(T) (5) 
i=i 

S = -{^)= k ^ Z N + Y (6) 

— m 

In general,the transition is located from the maxima of C(T) curve. The temperature T c 
at which maximum of specific heat is found, is lower than T m . Using Eqs. (4) and (7) 
and the fact that Z% is temperature independent and P (T m ) = \ one can calculate the 
difference (T m — T c ) for a given chain. We list in Table I the value of T c for several values of 
N and note that as iV increases T c approaches to T m and the peak in C(T) curve becomes 
sharper. The calculated entropy shows a jump from low value corresponding to the hairpin 
structure to a value equal to iVln^ (/i being the connectivity of the lattice for a SAW) at 
high temperatures. 

The jump in entropy at T m is found from the relation 

= U(T m ) 
k B k B T m 



TABLE I: Values of transition temperature and change in entropy at the transition temperature 
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FIG. 4: Same as in Fig. 3, but in three dimensions and for (a) N=15 and (b) N=19. 

We list the value of AS in Table I. The large change in entropy indicates that the 
transition is first order. 

From the principle of detailed balance one gets 

p 

o—c 1 c 

k c -o Pq 

where ki-j is the rate coefficient jumping from i to j state. The rate coefficient is assumed 
to follow the Arrhenius kinetics, k^j = k*_j exp(— (3AFi_j), where AF^j denotes the free 
energy barrier associated with jumping from i to j states and k\_,- is a constant depending 
on the chemical nature of the chain. When the conformation changes from the open state 
to the closed one, the free energy barrier AF Q _ C corresponds only to the entropy loss from a 
random coil to a ring polymer; and hence it should be temperature independent for a given 

7 



(9) 




FIG. 5: \n( L p^ c ) is plotted against j3 for several chain lengths N; (a) in two dimensions with 
L = m + 1 and g = 1.82 and (b) in three dimensions with L = m + 1 and g = 2.15. Slope in both 
cases is equal to the internal energy of the fully bound state. 

chain. On the other hand, when the structure fluctuates from the closed state to the open 
one, the energy barrier is the binding energy i.e. AF C _ ~ U. 

The probability that a polymer of length L folds on to itself to form a ring is given by 



PL 



(10) 



We find the main contribution to pi is due to the term corresponding to formation of one 
pair. As suggested above fc _ c is proportional to pi- For a self-avoiding walk of length L 
one finds 16] 

PL oc L~ 9 (11) 

where g = (dv + 7 — 1). Here d, 7 and v are, respectively, the dimensionality of space, 
susceptibility and correlation length critical exponents. Substituting the value of 7 and v 
one finds g = 1.83 in 2D and 1.97 in 3D. From our results we find that g = 1.82 in 2D and 
g = 2.15 in 3D and L = m + 1. When we compare these values of g with those found from 
Eq. (11) a very good agreement is seen in 2D, but in 3D the value found by us is higher. 
However, the value of g found by Monte Carlo simulation on 3D lattice is 2.15 [nl, Q] and 



the analytical estimates place its value in the range 2.10 < g < 2.20 19|. 
Using Eq. (9) we write 

1 1 P r 1 L 9 P r 



(12) 



k c -o &o-c Pq L 9 k Q _ c Po 
We plot in Fig. 5 ln(^- £ ) with L = m + 1 and g — 1.82 in 2D and 2.15 in 3D as a function 
of P for several chain lengths in 2D and 3D. As can be seen from these figures that the 
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values collapse into a single line with a slope equal to 0.4 ± 0.002. The slope is same in 
both 2D and 3D cases and equal to the value of the internal energy of the fully closed state. 
This result indicates that regardless of the loop length and dimensionality of the space the 
thermal fluctuations provide a probability of exp(—/3U) to unbind the closed conformation. 

We have presented a of self-avoiding walk on a lattice with on-site repulsion and ap- 
propriate constraints on self-attraction to describe the conformational behaviour of a single 
stranded DNA and RNA molecules that form hairpin structure. For a given chain of N 
monomers, we have enumerated all possible configuration of the chain and using this data 
we calculated properties associated with the coil-hairpin transition. The transition is found 
to be first order with large entropy change. The transition temperature (T m ) is found from 
the intersection of the probability corresponding to the open state and the closed state which 
includes both partially and fully closed states. These probabilities are found to intersect at 
their values of 0.5. The transition temperature (T m ) decreases on increasing the loop length 
as has been observed in a real system 6]. The transition temperature, T c found from the 
maximum of specific heat curve has the value lower than T m and the gap between the two 
decreases as the length of the chain increases. The probability of the partially closed states 
are small and decreases on increasing the part of the chain that forms loop. The rate of 
unzipping of the hairpin stem was found to be proportional to exp(—f3U) and independent 
of both the length of the loop and dimensionality of the space. The rate of closing is, how- 
ever, found to depend rather strongly on both the loop length and the dimensionality of the 
space. 
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